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ABSTRACT 

We  address  two  problems  of  memoryless  distributed  detection  with  dependent  observa¬ 
tions  across  time  and  sensors.  In  the  first  problem,  the  observation  sequence  of  each  sensor 
consists  of  a  common  weak  signal  in  additive  dependent  noise  with  stationary  univariate  and 
second-order  joint  densities;  here  the  objective  of  the  sensors  is  to  cooperatively  detect  the 
presence  of  a  weak  signal.  In  the  second  problem,  the  observation  sequence  of  each  sensor  is 
characterized  by  its  stationary  univariate  and  second-order  joint  densities;  here  the  objective  of 
the  sensors  is  to  cooperatively  disreiminate  between  two  arbitrary  such  sequences  of  observa¬ 
tions.  For  both  problems,  the  analysis  and  design  are  based  on  a  common  large  sample  size. 
The  dependence  across  time  and  sensors  is  modeled  by  m -dependent,  <j>-mixing,  or  p-mixing 
processes.  The  performance  of  the  two-sensor  configuration  for  each  problem  is  measured  by 
an  average  cost,  which  couples  the  decisions  of  the  sensors.  The  design  criteria  for  the  test 
satisfies  of  the  sensors,  which  constist  of  sums  of  memoryless  nonlinearities,  are  established  by 
using  two-dimensional  Chemoff  bounds  on  the  associated  error  probabilities  involved  in  the 
average  cost.  The  optimal  nonlinearities  are  obtain  as  the  solutions  of  linear  coupled  or  uncou¬ 
pled  integral  equations.  Numerical  results  for  specific  cases  of  practical  interest  show  that  the 
performance  of  the  proposed  scheme  is  superior  to  the  one  which  ignores  the  dependence 
across  time  and/or  sensors  for  each  of  the  two  problems. 


This  research  was  supported  in  part  by  the  Office  of  Naval  Research  under  contract  N00014-89-J-1375  and  in  part  by  the 
Systems  Research  Center  at  the  University  of  Maryland,  College  Park,  through  the  National  Science  Foundation’s  Engineering 
Research  Centers  Program:  NSF  CDR  8803012. 


I.  INTRODUCTION 


Decentralized  detection  presents  several  interesting  problems  (see  [1]  and  [2]).  Classical 
theory  of  centralized  hypothesis  testing  cannot  be  applied  directly  to  practical  systems  of  dis¬ 
tributed  decision  makers  (or  sensors).  The  advantages  of  employing  distributed  sensors  include 
low  communication  bandwidth,  good  reliability,  and  low  cost. 

In  most  of  work  on  distributed  decision  making  (see  [1-5]),  the  observations  (signal  in  noise) 
are  assumed  to  be  independent  across  time  and/or  sensors,  an  assumption  that  is  intended  to 
make  the  analysis  tractable.  However,  the  observations  are  <mnorally  dependent.  Indeed,  the 
observation  sequences  of  the  sensors  become  dependent  across  time  individually  for  each  sensor, 
when  sampling  rates  increase,  and  correlated  across  sensors,  when  the  locations  of  the  sensors 
are  close  geographically. 

In  contrast  to  the  model  of  independent  observations,  distributed  detection  from  dependent 
observations  across  time  and/or  sensors  are  studied  in  this  paper.  In  particular,  two  sensors 
without  communication  collect  their  own  observations  n  times  (the  common  sample  size)  and 
decide  which  hypothesis  is  true,  Hq  or  H\.  The  object  of  each  decision-making  process  is  to 
minimize  an  average  common  cost,  which  consists  of  error  probabilities  and  couples  the  two 
decisions  in  general.  The  form  of  the  average  cost  is  concerned  with  the  consensus  of  the  two 
sensors,  and  thus  can  be  applied  to  the  data  fusion  with  a  fixed  rule  such  as  ’OR’,  ’AND’  or  the 
majority  rule. 

Two  problems  of  different  descriptions  for  the  observations  are  considered:  in  the  first  one, 
each  observation  consists  of  a  weak  signal  in  additive  dependent  noise  with  a  stationary  uni¬ 
variate  density  and  second-order  joint  densities;  in  the  second  one,  each  observation  itself  is 
characterized  by  its  univariate  and  second-order  joint  densities  and  thus,  not  necessary  for  the 
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signal  in  additive  noise.  For  both  types  of  observations,  the  analysis  is  based  on  a  common  large 
sample  size  for  each  sensor,  which  guarantees  a  high-quality  performance. 

The  model  of  dependence  used  here  is  described  by  the  mixing  sequences,  such  as  the  rn- 
dependent,  the  </>-mixing  or  the  p-mixing  sequences  (see  [6-8]).  These  models  of  dependence  have 
been  extensively  and  successfully  used  in  single-sensor  detection  (see  [9-11]).  In  general,  the 
optimal  detection  scheme  under  the  above  model  involves  high-order  (larger  than  two)  densities, 
which  are  difficult  to  characterize,  and  needs  huge  space  of  memory  for  storing  the  dependent 
data.  To  avoid  such  inconvenience,  we  employ  a  suboptimal  structure  of  memoryless  nonlinearity 
for  each  sensor  in  this  work  (see  [9]);  each  observation  of  sensor  k  (k  =  1,2)  is  passed  through  a 
nonlinearity  Pfc(-)  and  summed  up  to  the  sample  size  n  to  form  the  test  statistic  Tn>k{gk{-))-  The 
activity  of  each  sensor  is  to  conduct  a  threshold  test  (as  in  the  likelihood  ratio  test)  by  using  its 
test  statistic.  The  design  criterion  for  each  nonlinearity  is  maximizing  an  objective  function  of 
efficacy-type  induced  from  the  average  cost  by  using  the  two-dimensional  Chernoff  bounds  on 
error  probabilities  (see  [9]).  Each  course  of  obtaining  the  optimal  nonlinearity  involves  solving 
an  integral  equation. 

Although  the  scheme  of  this  paper  is  suboptimal,  numerical  results  indicate  that,  with  rea¬ 
sonable  large  sample  size,  the  performance  is  better  than  that  of  the  one  which  ignores  the 
dependence. 

The  remainder  of  this  paper  is  organized  as  follows:  in  Section  II,  we  discuss  the  central 
limit  theorem  for  the  m-dependent,  the  0- mixing  and  the  p- mixing  sequences  and  derive  the 
two-dimensional  Chernoff  bounds.  In  Section  III,  the  first  description  of  observations  for  a  weak 
signal  in  additive  noise  is  considered.  In  Section  IV,  the  second  description  of  observations 
is  considered.  In  Section  V,  numerical  results  from  Cauchy  noise  for  the  first  description  of 
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observations  (weak  signal)  and  from  Rayleigh  vs.  log  normal  (univariate  and  second-order  joint) 
densities  for  the  second  description  of  observations  (strong  signal)  are  given.  Conclusions  are 
drawn  in  Section  VI. 
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II.  PRELIMINARIES 


II.A.  The  Central  Limit  Theorem  for  Mixing  Sequences 

Let  {xfk\t  =  1,2,  be  the  observation  sequence  of  sensor  k,  ( k  —  1,2);  the  test 

statistic  of  sensor  k  is  given  by 

r»jb(m)  =  (2) 

4  =  1 

where  gk(-)  is  the  nonlinearity  to  be  determined.  The  test  of  sensor  k  is 

>Hx 

Tn,k  <Ho  nVk  (3) 

where  nrjk  is  the  threshold. 

As  mentioned  in  Section  I,  the  model  of  dependence  is  one  of  the  m-dependent,  ^-mixing  or 
p-mixing  sequences.  However,  the  model  is  an  issue  here,  instead,  it  is  essential  that  appropriate 
conditions  are  satisfied,  as  that,  with  a  large  sample  size,  the  test  statistic  (3)  of  each  sensor  is 
asymptotically  Gaussian  distributed.  To  give  an  example  of  such  “appropriate  conditions,”  let 
us  cite  the  central  limit  theorem  for  an  arbitrary  function  h  of  the  ^-mixing  sequences  (see  [6]). 

Theorem  1:  Suppose  that  {Ty is  a  stationary  <£— mixing  sequence  with  Y1JL i<t>]  <  oo 
and  that  h  is  a  measurable  function  satisfying 

E[h(Yi)]  =  p,  var[/i(li)]  <  oo 

Then  the  series 

m 

o\h)  =  nartMF)]  +  2  £  c  <W[fc(y1)fc(l'-+1 )]  (4) 

i= i 

converges  absolutely  as  m  — >  oo.  Furthermore,  if  a2  >  0 

4jE  HYJ-nf,] 

*  j= 1 

converges  in  distribution  to  a  Gaussian  distribution  with  mean  zero  and  variance  o2 . 
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Proof  :  Set  h  -  h  -  p0,  thus  E[h(Yi)]  =  0  and  P[h2(Yi)]  <  oo.  The  above  Theorem  follows 
according  to  the  theorem  on  page  174  of  [6]. 

In  the  following  sections,  we  omit  the  arguments  of  the  means  fik,i{‘)  and  the  variances  o k,i{-), 
which  are  functionals  of  and  thus  functions  of  the  observations.  The  m  in  the  formulations 
is  finite  for  m-dependent  noise  and  m  — >  oo,  for  ^-mixing  or  p-  mixing  noise. 

II.B.  Two-Dimensional  Chernoff  Bounds 

The  cost  function  used  in  this  context  has  the  form 

*  ' 

0  if  d\  —  d2  —  u 

ci  if  d\  ^  d2>u=0 

C(di,d,2‘,  u)  =  <  c'  if  dj  ^  rf2?u=l  (5) 

C2  if  d\  =  d2  J1  w, «  =  0 
c2  if  d\  =  d2  7^  u  =  1 

where  dk  ( k  =  1,2)  is  the  binary  (0  or  1)  decision  of  sensor  k,  u  is  the  true  phenomenon  (u  —  0 
or  1),  and  ci,  c'j,  C2  and  c'2  are  non-negative  constants.  We  also  assume  that  C2  >  Ci ,  c'j  and 
c2  >  ci,ci,  i.e.,  the  penalty  is  higher  when  both  sensors  make  wrong  decisions.  This  form  of  cost 
is  of  the  Bayesian  type  and  has  been  useful  in  [1].  The  average  cost,  which  couples  the  decisions 
of  the  two  sensors  is 

E[C(dt,d2]u)]  =  pE0[C(di,d2;u)]  +  (1  -  p)E1[C(d1,d2-,u)\ 

=  p{ca[Po(l,0)  +  Po(0, 1)]  +  c2P0(l,  1)}  +  (1  -  p){c[[Pi(  1,0)  +  Pi(0, 1)]  +  c2Pi(0, 0)}  (6) 

where  p  is  the  a  prior  probability  of  Ho,  Pi  (i  —  0, 1)  are  error  probabilities  under  Hi,  P;(l, 0)  = 
Pi(Tn> i  >  nt]UTna  <  ni)2), P(0, 1)  =  P,(Tn)i  <  nr]x,Tn<2  >  nr}2)  (i  =  0,1),  P0(l,l)  =  Po(Tn,i  > 
nm,T n,2  >  nrj2)  and  Pi (0,0)  =  Pi(Tn,i  <  nJh,T»l2  <  nr]2). 
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Because  of  the  complexity  of  the  average  cost,  there  is  no  directly  tractable  approach  by 
which  a  nice  criterion  can  be  reached  for  each  optimal  nonlinearity  ( k  =  1,2).  Instead 
of  optimizing  (6)  directly,  we  first  derive  the  two-dimensional  Chernoff  bounds  on  the  error 
probabilities  in  (6).  In  Sections  III  and  IV,  we  approximate  the  bounds  under  a  large  sample 
size.  Our  purpose  is  to  approximate  and  combine  the  terms  in  (6)  as  an  exponential  form,  i.e., 
E[C(di(gi),d,2(g2)]u)]  ~  A  •  exp  [J(  <71,  <72)],  so  that  we  have  a  closed  and  meaningful  form  of 
design  criterion  for  each  nonlinearity. 

Define 


Y(m,V 2) 


I  1  if  Tn>1  >  nr/uTn^  >  nrj2 


0  otherwise, 

where  we  assume  that,  for  k  =  1,2,  Tn>k  ~  N’{npk,i-,'n(r'li)  under  Hi,  ( i  —  0,1),  and  the  pair 
(Tn<i,Tni2)  is  bivariate  Gaussian  distributed  with  the  correlation  coefficient  pi  =  Ei[Tn<  1  •  T„)2]. 
Then  for  si  >  0,  s2  >  0 


<  e.i7i+»2Ta> 


(7) 


Taking  the  expectation  of  (7)  under  Hq  and  we  have,  after  some  manipulations 


Po(Tn,i  >  ni]i , T„(2  >  71772)  <  e-3^-a^Eo[eaiTn'i+SiT^] 

=  V2tt  exp 


nal0sl  7 xolpsl  , 

— - - 1 - - - f-  npQO\flQ2flS\s2  -  si 71(771  -  711,0)  -  s2n{r}2  —  /i2,o) 


Similarly,  for  t\  >0,t2>  0,  and  <714  >  0,q2,i  >  0  and  r\ti  >  0,r2)j  >  0  we  have 


.  (8) 


Pi(Tn>1  <  717?|, Tn> 2  <  nrf2)  <  et'nr"+t*nr* £1[e-<lT"-1-‘2r»-2] 
=  \/27rexp 


ncrf  na^isl 

— ^ - 1 - 2 - ^  nP\°h\°2,\SiS2  +  5171(771  -  7114)  +  s2 71(772  -  7/2,1) 


(9) 


under  H\, 


Pi{Tn,  1  >  n??l)^n,2 


<  7/772)  <  e-?1’'Tl?J1+,2’'nT?2£l,[e?1’’:7'n’1-92'’Tn’2] 
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=  V2tt  exp 


naul2<lli  +  n<7l^h  _  npm  i(y2tiquq2<i  -  -  Mi,i)  +  ?2,i»(»ft  Ma,i) 

2  2 


(10) 


under  Hi,  ( i  =  0, 1)  and 

m  \  S  z/1  iTITJl— T2  tTl»}2  r.[e-n,i7n,l+>-2,irn,2l 

Pi{Tn,x<nr]l,Tnfl>nr}2)<e *■*  Ale  J 

=  n/2tt  exp 


n(Ti,,ri,«  ,  lifjiillii  _  npi(J\,i<y2,\r\,\Ti,i  +  n,in(7?i  -  Mi,*)  -  ?'2,in(^2  M2,*) 


(11) 

under  Hu i  =  0, 1.  The  bounds  given  by  (8)-(ll)  supply  good  approximations  to  the  error  prob¬ 
abilities  in  the  average  cost,  as  the  sample  size  increases.  The  positive  constants  «,*a,»,i,r*.i 
are  to  be  adjusted  to  set  final  forms  of  the  bounds.  Since  the  weights  (c2,c'2)  of  Pod,  D  or 
Pl(0,0)  are  greater  than  that  (c.c',)  of  Pi( 0,1)  or  P,(1,0)  (i  =  0,1),  we  first  minimize  the 
bounds  in  (8)  and  (9)  with  respect  to  (*,  *)  and  «„*,),  respectively.  If  we  minimize  (8)  with 

respect  to  ($1,^2)?  we  have 


Po(Tn,i  >  nrtnTnfi  >  nrj2) 


In,  -  ,  (m  -  I‘wf  2p0(m  -  WbK m  -  m,o) 

- _ - 1 - ~o 


'1,0 


'2,0 


01,  002,0 


(12) 


and  the  corresponding  (si,s2)  are 


where  0  <  Mo  — 


(yi  —  fli.o)a2,o  ~  (V2  ~  M2,q)MoA 

Sl  =  (1  -  Mo)ai, 0^2,0 

(l?2  ~  M2,o)<71.0  ~  (l?l  ~  Ml,o)/)lCr2,0 

52  ""  _  (1  -  MoW.oA.o 

2  <  1.  Similarly,  if  we  minimize  (13)  with  respect  to  {h,t2),  we  have 


(13) 

(14) 


Pi(T„,i  <  m7i,Tn,2  <  nrj2) 


< 


_  f  n  [(Mi.i-m)2  (M2,1  -  m?  2mi(mu  -  i?i)(M2,i  JTg) j  \  (15) 

^exp\-2(l-7fj  ^  ct2,i  CTl-1<T2’1 
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and  the  corresponding  (<i ,  <2)  are 


-  ..  (7*1,1  -  m)o 2,1  -  (/* 2,1  - 

1  (!  -  Pl)^l,l2^2,l 

(16) 

_  (7*2,1  -  92)01,1  -  (7*1,1  -  m)pi^2,i 

(1  -  7>l)02,l20i,l 

(17) 

Under  the  restrictions  Sk  >  0  and  tk  >  0  (k  =  1,2),  we  have 


and 


9i  -/*i,o 

_  %  -  7*2,0 
>  7>o, 

Ol,0 

02,0 

92  -  7*2,0 

.  91-7*1,0 

>  Po, 

02,0 

Ol,0 

7*1,1  -  91 

7*2,1  -  92  „ 

>  _  pi 

02,1 

0-1,1 

7*2,1  -  72  _  7*1,1  ~  9i 

- > - />! 

02,1  ^l.l 


(18) 

(19) 

(20) 


(21) 


Let  us  now  consider  (10)  and  (11).  We  can  minimize  (10)  with  respect  to  (ri,,-,  r2,;),  or  (11) 


with  respect  to  (51  ,i,  92,0 •  However,  either  minimization  will  lead  to  results  contradictory  to 


(18)-(19)  or  (20)-(21),  if  the  constraint  >  0  or  >  0  (i  =  0, 1;  k  =  1,2)  are  to  be  satisfied. 
Therefore,  we  choose 


71  -  7*1,0 

r°d  _  ^2  ’ 

al,0 

1*2,0  =  0 

(22) 

P2,i  -  m 
^2,1  9  ’ 

ct2,1 

n,i  =  0 

(23) 

9i  -  7*2,0 
92,0  =  2  » 

02,0 

91,0  =  0 

(24) 

7*1,1  -  91 
9l,l  -  2  ’ 

Ol,l 

92,1  7  0 

(25) 

which  give  the  bounds  on  (10)  and  (11)  as 


Pi(Tn,  1  >  nrj-iiTnt  <  rn]2)  <  \/2irexp 


-»(??i  -  7*i,t)2 


Ki 


(26) 
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and 


Pi(Tn> i  <  nr) i,T„i2  >  nr)2 )  <  V2nexp 


-n(r/2  -  M2,;)2 


2^I,  « 


(27) 


In  the  above  bounds,  all  parameters  and  qkti  (t  =  0, 1  and  k  =  1,2)  are  nonnegative. 

The  constraints  satisfied  by  r)k,Pk,i  and  ak,i  are  described  by  (18)-(21)  and 


Pk,o  <Vk<  Mfc, l 


(28) 


for  k  =  1,2. 

Finally  the  bound  on  the  average  cost  has  the  form 


E[C{dud2-,u)}  <  Bu 


—  p{c\  |\/27r  exp 
-f  c2\/27rexp 


■n(r)i  -  pii0)2 


r 


2cti,o 

n 

WH) 


-f  \^Texp 
\2 


-n(rj2  -  /i2,o)2 


H,o 


{Vl  -  Pi, o)2  ■  (*?2  -  /r2,o)2  2p0(j?i  -  Mi,o)(f?2  -  M2,o) 

9  I 


'1,0 


+(1  -  p){c\  |\/2^exp 

+C2\/27r  exp  { 


- n(m  ~  Mu)2 


Hi 


+  y/Hrexp 


#2,0  01,002,0 

-n(r)  2-P2,i)21 


Hi 


n 


{  2(1  -pj) 


{m-Pi,iY  ,  {i2-P2,i)2  %pi{vi  —  Pi,i){r)2  —  P2,i) 
2  „2 


'1,1 


'2,1 


01,102,1 


■} 
(29) 


For  the  purpose  of  convenience,  in  the  following  sections  we  will  set  c\  =  c\  and  c2  =  c'2, 
which  can  be  applied  to  the  data  fusion  with  a  ma  jority  fusion-rule.  For  other  fusion-rules,  the 
techniques  in  the  following  sections  remains  useful  with  minor  modifications. 
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III.  WEAK  SIGNALS 


We  consider  tlie  following  observation  model  of  a  weak  signal  in  two-sensor  detection 

:  XM  =  Nt(k) 

H[k)  :  Xi(k)  =  6  +  N<k\  i=l, k  =  1,2  (30) 

where  0  — »  0  with  6  >  0  is  the  nonrandom  weak  signal  and  {N^k\i  —  1,  ■  •  *,n},  ( k  —  1,2)  are 
two  noise  sequences  with  stationary  univariate  and  second-order  joint  densities.  The  asymptotic 
analysis  based  on  a  large  sample  size,  which  guarantees  a  high-quality  performance  for  9  — ►  oo, 
is  used.  For  this  model  of  observations,  we  assume  that 

Eg[9k]  <  oo,  vare(gk)  <  oo  (31) 


and 

m 

°l,e(9k)  =  varM[</A(*,-fc))]  +  2  covM[^(^-fc))5fe(^j+i)]  >  0  (32) 

i=i 

for  all  0  >  0.  In  addition,  in  this  description  of  observations  we  use  the  notations 

°l,e{9k)  =  (33) 

npe(9i,92)  =  npi(gi,g2)  =  Ee[(TnA  -  nplti)(Tna  -  np2,i )]  (34) 

and 

Pk,e{9k)  =  Pk,\{9k)  =  J 9k{x)fk{x  ~  9)dx  (35) 

for  0  >  0,  where  fk(-)  the  univariate  density  of  and  thus  of  xjk ^  under  Ho. 

As  discussed  in  Section  II,  conditioned  on  Hi  for  7  =  0,1  and  0  >  0,  the  test  statistics 
Tn,k  of  each  sensor  are  asymptotically  Gaussian  distributed  with  means  npkye(gk)  and  variances 
no\  e(gk).  Furthermore,  Tn< i  and  Tn< 2  are  bivariate  Gaussian  distributed  with  the  correlation 
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coefficient  np$(gi,g2).  In  addition,  the  following  regularity  conditions  are  assumed,  for  k  =  1,2 
and  0  — ►  0,  under  a  large  sample  size 


J 9k(x)fk(x  -  6)dx]  =  J  gk(x)fk(x  -  6))dx , 

Hk,o(9k)  -  Hk,o{gk)  dpeigk)^ 

- -9 - *  — dT~]e=°  >  °’ 

0k,e{gk )  -*■ 


(36) 

(37) 

(38) 


and 


Pb(9i,92)  Po(9i,92)- 


(39) 


Since  9  — ♦  oo,  we  use  ak,o  and  po  to  represent  the  variance  and  the  correlation  coefficient  under 
either  hypothesis  in  the  following  Sections. 

III. A.  Dependence  Across  Time 

First  let  us  consider  the  case  when  the  two  observation  sequences  are  independent.  The 
locally  optimal  test  of  each  sensor  as  shown  in  [1]  is  a  likelihood  ratio  test  with  a  data-dependent 
threshold.  However,  the  computation  of  the  optimal  thresholds  is  generally  complicated  and 
depends  on  high-order  (larger  than  two)  densities  of  the  observation  sequences  that  are  difficult 
to  characterized.  Here  we  develop  a  suboptimal  approach  by  using  the  central  limit  theorem 
and  the  bounds  derived  in  the  Section  II.  The  threshold  of  each  sensor  for  design  purpose  is  easy 
to  determine  and  depends  only  on  the  means  of  the  nonlinearity. 

Since  the  correlation  coefficient  po  in  Section  II  is  zero  in  this  case,  the  bound  of  (29)  in 
Section  II  has  the  form 


Bu  —  p{ciV^7rexp 

-j-C2\/27rexp 


-n(g i  -  pi,o) 


2(7i,o 


-f  civ/27rexp 


-nim-p  2,o)2 


2°2,0 


-n(r?i  -  ^i,0)  -n(r)2  -  P2,o) 


2cti,o 


2<72,0 
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+(1 -p){ci\/27rexp  n(7?i  +  ciV&rexp  ~— r>~-2—2'  —  - 

^1,0  J  ^a2,Q 

+  (40) 

zal,0  Z(T1,0 

where  pk,o  <  i]k  <  Mk,i-  Notice  that  there  are  two  parts  in  (40):  one  is  from  the  error  probability 

under  Ho  and  the  other  is  from  the  error  probability  under  Hi.  Using  the  fact  that  for  positive 

constants  (ai,&i)  and  negative  constants  (<12,62) 

aien“2  +  bxenb>  »  2aibie^+nh^2  (41) 


as  n  increases,  we  can  approximate  the  bound  in  (40)  by 


l  8  alo  aio 


b2,o)2  1 

,0  J 


+(1  -  p)Aex  p  j  r|2 

o  O',  n  O' 9 


r2]) 


where  yl  is  a  known  constant.  Because  the  optimal  thresholds  are  difficult  to  calculate,  we  use 
the  suboptimal  thresholds  determined  in  an  ad  hoc  way.  Since  the  error  probabilities  cannot  be 
known  a  priori,  we  pose  the  condition 

(yi  -  Ml,o)2  ,  0? 2  -  M2,o)2  _  (m  -  /^l,g)2  ,  (??2  -  M2,g)2  /  ,n\ 

9  •"  9  ”  9  '  9  V™/ 


for  all  pk, 0,  Vk,e  and  ak,o  {k  =  1,2)  to  balance  the  two  error  probabilities  under  two  hypotheses 
in  (40),  so  that  we  do  not  bias  either  hypothesis,  One  solution  of  (43)  for  (771 ,  t]2)  is 

m  =  t=1>2.  (44) 


It  is  easy  to  check  that  the  constraints  (18)-(21)  and  (28)  are  satisfied  by  using  the  above 
thresholds.  The  bound  (42)  employing  rjk  (k  =  1,2)  given  by  (44)  is 


)2  .  (P2,e  -  P2,oY 
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We  have  reached  the  exponential  form  given  by  (45)  for  the  approximate  upper  bound  on 
the  average  cost  by  using  the  above  thresholds.  Minimizing  this  exponential  form  with  respect 
to  (<7i,</2)  is  equivalent  to  maximizing  the  exponent 


n(n  i,e(gi)  -  Mi,o(5i)£  ^2,0(02)  ~  /* 2,o(02))2 


*1,0(01) 


*2,0(02) 


=  e 2 


[{ViAsi)  - /U,o(0i))/0]2  .  [{H2A92)  -  1*2,0(92)) /9]2 


+ 


(46) 


*i,o(0i)  '  *2,0(02)  j 

where  the  weak  signal  6  — ►  0.  Using  the  regularity  condition  (36)  and  normalizing  (46)  by 
dividing  it  by  02,  we  have  the  equivalent  objective  function  for  design  criterion 


,  {dEg[gi]/dd}j=0  ,  {dEM/dO} Lo 
J  [91,92)  = - ■  2  '/"\ - + - ZT7Z 1 - 

*1,0(01)  *2,0(02) 


(47) 


which  is  to  be  maximized  with  respect  to  (<h,<jf2)-  We  notice  that  (47)  is  the  summation  of  two 
efficacies,  and  thus  the  criterion  becomes  a  generalization  of  the  ARE  in  single-sensor  detection 
(see  [9]).  In  fact,  we  have  shown  the  connection  between  the  general  ARE  and  the  average  cost 
for  the  detection  of  a  weak  signal  in  a  multiple-sensor  environment.  Because  the  observation 
sequences  of  the  two  sensors  are  independent  of  each  other,  we  can  maximize  the  efficacy  for 
each  sensor  in  (47)  with  respect  to  its  gk.  Notice  that  the  efficacy  for  each  sensor  in  (47)  is 
invariant  under  the  scaling  of  its  gk-  Using  the  same  technique  as  in  [9]  we  have  the  following 
integral  equation 

/OO 

Kk[x,  y)9k(y)dy  =  gk(x )  (48) 

-OO 

for  optimal  gk,  where  the  integration  kernel  Kk[x,y )  has  the  form 


Kk(x,y)  =  2  Jlf$+l/Nl(y\x)-  (2m  +  l)/fc(0)»  k  =  1>2. 
i- 1 


(49) 


The  procedure  of  solving  (48)  was  addressed  in  [9]  and  will  not  be  duplicated  here. 
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III.B.  Dependence  Across  Time  and  Sensors 

We  consider  now  the  case  when  the  two  stationary  observation  sequences  are  dependent 
across  time  and  sensors.  The  situation  of  interest  here  is  that  the  univariate  and  second-order 
joint  densities  of  the  two  observation  sequences  are  identical  and  the  structures  (i.e.,  the  non- 
linearities  and  thresholds)  of  the  two  sensors  are  the  same.  In  our  formulations,  we  use  the 
notations  / 1(-)  =  /2(-)  =  /(•),  •)  =  /j2)( *,  •)  =  fj(;  •)»  M')  =  j'a(-)  =  ff(')  and  %  =  tj2  =  rj. 

Consequently,  hi,o  =  H2,o  =  Ho,  H  1,6  —  H2,6  —  Ho,  and  <7ito  =  <72,0  =  cr0.  These  conditions  char¬ 
acterize  a  model,  for  which  the  channels  of  the  two  sensors  are  symmetric  and  the  two  sensors 
are  implemented  under  the  consideration  of  regulation  with  the  same  structure. 

Under  the  above  symmetric  conditions,  the  bound  described  by  (29)  has  the  form 


Bu  -  p{2v^7rci  exp[— ]  +  V2nc2 


2 °l  J  '  ^  (1  +  poK 


+(1  - „)(2v^C> expl-  ■”-(y  >  ]  +  ^expl  )? 

Using  the  same  technique  of  balance  condition  as  in  the  previous  case 


(50) 


po)2  _  -n(rj  -  He)2 

2(7^ 

-n(ri-no)2 

(1  +  Po)<Tq 


2<?l 

(1  +  poK  ’ 


(51) 

(52) 


we  obtain  one  solution  for  ?? 


T]  = 


Ho  +  Hb 


(53) 


Again  conditions  (18)-(21)  and  (28)  are  satisfied  with  this  threshold.  The  bound  defined  from 
this  threshold  is 


*  - 2^'  - +  ■ 


(54) 
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The  globally  optimal  nonlinearity  that  minimizes  (54)  is  difficult  to  obtain  and  may  even  not 
exist.  Therefore,  we  design  the  nonlinearity  in  the  ad  hoc  way  described  as  follows. 


We  notice  that  there  are  two  exponents  in  (54),  namely 

n  ^  n(Ms)  -  Po(ff))2  _„2  [(/**($) -/*>(tf))/0]2  „n2{[dEe[g)/dO\g=l 0}2 

- - 2^J) - -  ne  - 8^) - =  nt  - £*) -  (65> 

and 

T?  ( a \  n^e{9)  -  Mo (g))2  _  (p  KMff)  ~  Mo(g))/^]2  _  .2  {[dE[g]/de]8=0}2 

2{9)  (l  +  Po(9)K(g)  4(l  +  po(5)K2(5)  "  *(l  +  Po(g)W{9)  (  j 

as  6  — *  0.  First  we  obtain  the  optimal  gR ,(•)  by  maximizing  the  functional  i?i(-)  with  respect  to 
g(-).  Then  we  maximize  R2(-)  to  obtain  the  optimal  gji2(-).  From  gRl(-)  and  gRi{-)  we  compose 
the  nonlinearity  of  each  sensor  as 

g(x)  =  rwgRl(x)  +  (1  -  rw)gn2(x ),  (57) 

where  the  parameter  rt0  is  determined  through  numerical  experimentation.  The  optimization  of 
jRi(-)  has  been  discussed  in  Subsection  III.A.  For  the  optimization  of  i22(-)>  we  can  derive  an 
integral  equation  satisfied  by  the  optimal  nonlinearity  <?r2(-)  in  a  manner  similar  to  that  in  [9]. 

Notice  that,  for  all  a  ^  0,  R2(g )  =  R2(ag),  i.e.,  R2{-)  is  invariant  under  the  scaling  of  its  <?(•). 
Therefore  under  the  regularity  condition  (37),  maximizing  R2{-)  with  respect  to  g{-)  is  equivalent 
to  maximizing  —  / g{x)f'(x)dx  under  the  constraint  that  (1  +  /9o(<7))0o(s)is  a  constant.  Thus 
the  optimization  of  R2(g)  is  characterized  by  max3  H(g)  with 

H(g)  =  {-  J g(x)fo{x)dx  +  x[i  +  po(g)}v%(g)}  (58) 

where  A  is  the  Lagrange  multiplier  and 

M4(.s)  =  E„  { |r"'1^’,*‘°1  •  ) 

m 

=  ^[ff(*i1))^(*i2))]  +  2  -  (2m  +  l)Po  (59) 

j=i 
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by  using  a  large  sample  size. 


Denote  the  variation  of  <?(•)  by  Sg(-)  and  let  Jg(e)  —  H(g  +  eSg).  Then  the  necessary  and 


sufficient  conditions  satisfied  by  the  optimal  g  for  arbitrary  6g  is 


J'(0)  =  0 


(60) 


and 


J"9{ 0)  <  0.  (61) 

Let  /  (i)  (j)  be  the  joint  density  of  and  TVj U  and  /  (i)  <2)  the  joint  density  of  and 
We  have 

J£(0)= +  2\[g(x)f(x)  +  J  [23(/j(a:,  y )  +  fj(x,  y ))  -  2f(x)f{y)\dy]}6g(x)dx  (62) 


where 


fj{x,y)  =  (fN(>)'N«)(x,y)  + fNwyN(2)(y,x))/2 

+fN(1)>Nw(x’y)  +  -  2/(®)/(y) 


and 


/j(z,  v)  =  /N(  1)  N0)  (x,  y )  +  /«o)  »o)  (w,  *)  -  2 /(*)/(y). 

JV3  ’iV/+l  iVl  >+3 

From  (60)  and  (61),  we  have  the  linear  integral  equation 

/OO  771 

ff(y)(]C  /i(*> »)  +  /;(*>  »))<*»!  = 0 

■°°  j=i 


where  A  is  a  scaling  factor  of  5.  From  (61)  we  obtain  the  sufficient  condition 


2A(1  +  po(6g))<ro(6g)  <  0. 


(63) 


(64) 


(65) 


(66) 


Since  />o(-)  <  1?  (66)  holds  for  negative  A.  Following  the  tradition  in  single-sensor  detection  (see 
[9])  for  i.i.d.  observations,  we  set  A  =  —1/2;  thus  the  integral  equation  to  be  solved  for  the 
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optimal  g  can  be  expressed  as 


/oo 

Kc(x,  y)g{y)dy  =  g(x), 

-OO 

where  g/0(x)  =  —f'(x)/f(x)  is  the  locally-optimum  detector  for  i.i.d.  observations 
and  sensors  and 


I<c{x,y)  =  (fNw/Nw(y\x)+ fNo)/N(2)(y\x))/2-2f(y) 

m 

+  D4j;>  /nMx)  + 

+/NW  /N(i)(2/k)  +  fNw  /Nw{y\x)  -  4 f(y)] 

m 

f /M(1)(y\X')  ~  2/(2/)  ~b  2  ^  ][fp.r(l)^  / ^\r( 1 ) (3/ 1  ^ )  ~b  4(3,  ^(1)(^l*c)  — 


is  the  kernel  of  integration. 


(67) 
across  time 


2/(2/)]  (68) 
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IV.  STRONG  SIGNALS 


In  this  section  we  consider  the  second  description  of  observations,  which  is  characterized 
by  the  univariate  and  second-order  joint  densities.  The  distributed  detection  for  this  type  of 
observations  can  be  described  as  the  following  binary  hypotheses  testing  problem: 


r(fc)  .  y.(fc) 


Xiik)  ~  [A,a(Jfpl),/g(V,!i),xW)] 


H[k)  :  Xt{k) 


Xi{k)  ~  [hAxlk))jlj}(X?\xlk))],  i  =  k  =  1,2 


where  fjk\-)  is  the  univariate  density  of  the  observations  for  sensor  k  under  Hi  (/  =  0, 1)  and  the 
two  observation  sequences  ( k  =  1,2)  are  modeled  by  the  m-dependent,  the  </>-mixing 

or  the  p-mixing  sequence. 

Although  the  use  of  the  Chernoff  bound  simplifies  the  analysis,  the  determination  of  the 
optimal  thresholds  involves  complex  computations  and  results  in  nonlinear  optimization  for 
the  optimal  nonlinearities.  We  pose  additional  conditions  for  the  relationship  between  rjk  and 
(pk,i iak,i)  (J  =  0, 1)  and  express  rjk  (k  =  1,2)  as  functions  of  (p/t,t,cr|i().  Under  such  conditions 
for  our  design  criteria,  a  large  sample  size  is  used  to  guarantee  high-quality  performance. 

IV. A.  Dependence  Across  Time 

In  this  case,  we  assume  that  the  two  observation  sequences  are  independent  across  sensors 
conditioned  on  either  hypothesis;  thus  there  is  no  correlation  between  the  two  statistics  Tn<\  and 
Tn,2i  i-e.,  pi  =  0  for  /  =  0, 1.  By  using  the  approximation  (40),  we  have,  for  this  case, 


pA\  exp 


j-A2?i  - 


Ml,o(gl))2  ,  (??2  ~  ^2,o(ff2))2 

0  /  \  »  O  /  \ 


IM) 


°io  (92) 


+(1  -  p)Ai  exp 


(m  -Pi,i(gi))2  ,  (m-ji2A{g2))‘ 


hOn) 


*2,1(92) 


where  A\  and  A2  are  known  constants. 
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For  the  same  reason  as  in  the  previous  section,  we  pose  the  balance  condition 


for  each  sensor  k. 
are 


(Vk  ~  gfc,o(gfc))2  (Vk  ~  VkASk))2 


(71) 


°io(9k)  **,i(gfc) 

Under  the  assumption  ^k,i(gk)  >  Mfc,o(gfc)>  the  solutions  of  (71)  for  (r?i,  rft) 


m  = 


°kAgk)vk,o(gk)  +  g*:,o(fffc)Mfc,i(flfc) 

*fc,l(5fc)  +  Vkfl(gk) 


(72) 


for  k  =  1,2.  The  conditions  (18)-(21)  and  (28)  are  automatically  satisfied.  Although  these 
thresholds  are  suboptimal,  they  depend  only  on  (nk,ii^k,i)i  (k  —  1,2  and  i  —  0,1),  which  can 
be  calculated  directly  from  the  observations.  Consequently,  the  bound  given  by  (70)  takes  the 
form 


Bu  =  A1exp{-A2N[Je(gx,g2)]} 

[(Mi,i(gi) 


=  A\  exp  <  -A2N 


{- 


■/*l,o(gl))2  t  (^2,l(g2)  ~  M2,o(ff2))2 


(*l,l(gl)  +  *l,o(gl))2  (*2,l(g2)  +  *2,o(g2))2J 


(73) 


where  A\  and  A2  are  constants. 

The  minimization  of  the  above  objective  function  (i.e.,  the  maximization  of  Je )  with  respect 
to  gk,  for  k  =  1,2,  results  in  nonlinear  integral  equations,  which  are  difficult  to  solve.  To  simplify 
the  design  procedure  further,  we  approximate  Je  in  (73)  as  follows 


^e(gl  1  52) 


(gi,i(gi)  -  gi,o(gi))2 


+ 


0k2,l(g2)  -  M2,o(g2))2 


71*1, l(gi)  +  (1  -  7i)*i,o(gi)  72*2,1(52)  +  (1  -  72)^2,0(52) 


=  «^y(5l»  52, 7l»72) 


(74) 


where  the  denominators  of  the  two  terms  for  the  two  sensors  are  weighted  sums  of  cr|  2  and  a\  0, 
for  k  —  1,2,  with  weights  7 k-  This  form  of  design  criterion  has  been  useful  for  single-sensor 
detection  (see  [12]).  The  weights  7 k  are  chosen  to  maximize  </(gii7),  </272),  where  glnk  {k  =  1,2) 
are  the  optimum  nonlinearities  for  fixed  7 k- 
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Since  there  is  no  correlation  between  the  two  sensors,  maximizing  J^guSi)  (here  the  argu¬ 
ments  7^  are  omitted)  with  respect  to  (^1,52)  for  fixed  7*  is  equivalent  to  maximizing 


Jk{(Jk)  = 


0*fc,i(5fc)  -  Mfc,o(5fc))2 

+  (1  -  7k)<rlo (9k) 


with  respect  to  5 k  ( k  =  1,2),  respectively.  We  notice  that  J k  ( k  =  1,2)  are  invariant  under  the 
scaling  of  5^,  i.e.,  Jk(gk)  =  Jk(ctkgk)  with  the  scaling  factors  ak,  and  thus  we  can  use  the  same 
technique  as  in  the  previous  section  (see  [9])  for  the  optimizations  of  Jk.  The  optimizations  of 
J*,  for  each  sensor  is  equivalent  to  the  maximizations  of  the  objective  function 


Hk(dk)  =  MW (9k)  ~  /tfc,o(5fe)  +  *k[lk<7k,i(9k)  +  (1  -  lk)ol,\{9k)\ 


with  respect  to  5*,  where  A k  is  the  Lagrange  multiplier. 

Performing  the  optimization,  for  k  =  1,2,  we  obtain  the  following  linear  integral  equations 

fk,\{x)  -  fk,o(x)  -  J  Kk(x,  y)9k{y)dy  =  gk(x)[ykfk,i(x)  +  (1  -  7fc)/fc,o(»)]  (77) 

where 

m 

I(k{x,y)  =  (x,y)  -  2/M(*)/M(y))  +  (1  -  ik){f$(x,y)  -  2 fk>o(x)fk,o{y))] 

3= 1 

—lkfk,l{x)fk,l{y)  -  (1  -  lh)fk,o{x)fk,o{y) 


is  the  kernel  of  integration  with  the  notation 


v) = =  *)• 


Solving  each  integral  equations  for  fixed  yk  and  k  =  1,2  we  obtain  the  optimum  nonlinearity, 
which  is  a  function  of  7*.  It  is  straightforward  to  show  that 

7fccrfc,i(5i)  +  (1  -  lk)crlfi{9k)  =  lkvarkjl(gk)  +  (1  -  7fc)varfc>0(5fc)  +  J  J  Kk{ x,  y)gk(x)gk(y)dxdy 
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where 


Kk(x,  y)  =  Kk(x,  y )  +  ykfk,i(x)fk,i(y )  +  (1  -  lk)fk,o(x)fk,o(y)  (80) 

for  k  —  1,2,  and  thus  for  which  are  the  solutions  of  (77) 

+  (1  -  lk)alfi(gink)  =  J (fk,i(x)  -  fk,o(x))gllk(x)dx.  (81) 

Finally,  the  optimal  jk  ( k  =  1,2)  are  obtained  by  maximizing 

Jk(9ks yj  =  J (fk,i(x)  ~  fkfl{x))glrik(x)dx.  (82) 

IV.B.  Dependence  Across  Time  and  Sensors 

We  consider  the  case  for  which  the  observations  are  dependent  across  time  and  sensors. 
We  are  particularly  interested  in  symmetric  case  similar  to  the  one  in  Subsection  III.B,  i.e., 

/i,»(*)  =  hAx)  =  /<(*) 


conditioned  on  Hi  ( i  =  0, 1)  and  t)\  =  -  g.  Again,  the  same  form  of  the  nonlinearity 

ffi(-)  =  52(0  =  5(0  is  used.  Therefore,  pi,,  =  p2,i  =  Vi  and  a i,j  =  o2yi  =  <7,-,  for  i  =  0, 1.  In  the 
present  case,  the  upper  bound  given  by  (29)  takes  the  form 

*■  = 

+  v^(l-p){2c,exp  [-■' "-iy 1  ]  +c,exp  ]}  (84) 

which  consists  of  two  parts  from  the  two  hypotheses.  Instead  of  optimizing  the  above  bound 
directly,  we  proceed  in  an  ad  hoc  way  to  determine  the  nonlinearity.  Basically  there  are  two 
different  forms  of  exponents  in  Bu,  namely 

BU1  =-  V2np2ci  exp[  "^2^  ]  +  v/27r(l  -  p)2cx  exp[  )  (85) 
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and 


BU2  =  \f2irpc2  exp[  -■  y?]  -]  +  c2exp[— ' ~  j~] 


(86) 


(i  +  goK  Fl(i  +  PiW 

As  discussed  in  the  case  of  a  weak  signal,  we  first  simplify  the  problem  by  posing  the  balance 


conditions 


for  BUl  and 


(g~Mg))2  (Mg)  -  g)2 

2^o  (g)  2^(5) 


(v-  Mg))2  _  Mg)  -  g)2 

(1  +  Po(g)>o(g)  (1  +  Mg)W(g) 


(87) 


(88) 


for  BU2,  such  that  the  threshold  can  be  set  for  the  design  criterion.  The  solutions  of  (87)  and 


(88)  are 


fri(g)Mg)  +  go(g)/*i(g) 
Mg)  +  Mg) 


(89) 


and 

„  =  il +  Mg))M(g)Mg)  +  (i  +  Mg))*Mg)Mg)  f90, 

(1  +  gi(g))^i(g)  +  (1  +  Po(g))^o(g) 

respectively.  Again  conditions  (18)-(21)  and  (28)  are  satisfied  with  both  choices  of  threshold. 
We  notice  that,  by  using  the  above  thresholds,  the  exponents  in  BUl  and  BU2  essentially  take 


the  form 


and 


(Mg)  ~Mg))2 
Mg)  +  Mg))2 


_ Mg)  -  Mg))2 _ 

[(1  +  Mg))*  Mg)  +  (1  +  po(g))*Mg)]2 


(92) 


Since  the  optimization  of  either  (91)  or  (92)  with  respect  to  g(-)  leads  to  nonlinear  integral 


equations,  we  use  approximations  similar  to  the  one  in  Subsection  IV. A  In  summary,  the  criterion 


for  optimum  nonlinearity  </(•)  are 


7  _  Mg) -Mg))2 

a{9)  7<M(g)  +  (l-7a)Mg) 


(93) 
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and 

Mg)  = 

Then,  for  fixed  7a  and  7 &,  we  maximize  Ja  and  «/(,  with  respect  to  <?(•)  independently  to  obtain 
two  different  forms  nonlinearities  ga(-)  and  gb(-),  respectively;  the  respective  7a  and  74,  are  chosen 
by  maximizing  J(g*la )  and  Finally,  the  nonlinearity  employed  by  the  two  sensors  is 

9*(x)  =  rsgl(x)  +  (1  -  Ts)gt{x)  (95) 

where  the  parameter  ts  is  set  by  numerical  tests. 

The  maximization  of  (93)  with  respect  to  g  and  ja  has  been  discussed  in  Subsection  IV.A. 
The  linear  integral  equation  satisfied  by  the  optimum  <7*  is 

fi(x)  ~  fo(x)  ~  J  K(x,y)g*(y)dy  =  $C(®)[7«/i(*)  +  (1  -  7«)/o(*)]  (96) 

where 

m 

K(x,y)  =  ^2{ia[h,j(x^y)  -  2h(x)My)}  +  (i  -  7«)[/oj(*,y)  -  2/0(as)/0(if)]} 

3= 1 

-7a/l(z)/l(y)  -  (1  -  7a)/o(*)/o(y)  (97) 

is  the  kernel  of  integration  and 

fi,j(xi  V)  ~  Mx  1  =  xi+ 1  =  V)  +  fi(x 1  =  Vi  xj+ 1  =  x )•  (98) 

Because  Jb  is  also  invariant  under  the  scaling  of  g  [Jb(sg)  =  Jb(g )  for  constant  s],  equivalently, 
we  maximize  the  following  form 

Hb(g)  =  Ml (g)  -  MO (g)  +  A[7fc(l  +  Pi(g))<rl(g)  +  (l  -  7*)(l  +  Po(g))<To(g)\  (99) 

with  respect  to  g  for  fixed  7 where  A  is  the  Lagrange  multiplier.  Using  the  same  technique  as 
in  Subsection  IV.A,  we  obtain  the  following  linear  integral  equation  satisfied  by  the  optimum  gl 

f\(x)  -  /o(*)  -  J [ Kc{x,y )  +  Kc{x,y)\gt{y)dy  =  [76/1  (®)  +  (1  -  7t)/o(*)K(*)  (100) 


_ (Mi(ff)-Mo(g))2 _ 

76(1  +  pi(g))oftg)  +  (1  -  7fc)(l  +  Po(9))°o(9)' 
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where 


m 

Kc(x,y )  =  2  y)  -  /i(*)/i(y)]  +  (1  -  7b)[fo\x,y)  -  fo(x)fa(y)}} 

3= 1 

—~tbf\(x)fi(y)  -  (1  -  ib)Mx)fo(y)  (101) 

is  the  kernel  of  integration  and 

Kc(x,y)  =  ib[fi0)(x,y)  -  M*)h(y)]  +  (l  -  ib)[fo°\x,  y)  -  fo(x)f0(y)] 

m 

+2^2{'yb[fi\x,y)  -  fi{x)fi{y)]  +  (1  -  7 b)[fo\x,y)  -  fo(x)f0(y)}}  (102) 
j=i 

with  •)  the  joint  density  of  xj1^  and  across  sensors. 
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V.  NUMERICAL  RESULTS 


To  support  the  analysis  for  the  proposed  scheme  in  previous  sections,  we  consider  the  p- 
mixing  Cauchy  noise  for  the  weak  signal  model  characterized  in  Section  III,  and  the  observations 
of  p-mixing  Rayleigh  densities  (under  Hq )  vs.  the  ones  of  p-mixing  log-normal  densities  (under 
Hi)  for  the  strong  signal  model  characterized  in  Section  IV.  The  ms  are  truncated  to  values  which 
are  large  enough  for  the  simulations  in  different  examples.  The  sample  size  of  the  simulations 
is  100000  for  weak  signals  and  1000  for  strong  signals. 

V.A.  Weak  Signals 

We  first  consider  a  weak  signal  in  additive  Cauchy  noise  which  is  symmetric  with  respect  to 
its  medium.  The  performance  by  using  the  optimal  nonlinearities  is  compared  with  the  one  by 
ignoring  the  dependence  across  time  and/or  sensors  for  the  two  cases  given  in  Section  III. 

Without  loss  of  generality,  the  the  weak  signal  9  is  assumed  to  be  0.06  for  convenience.  The 
univariate  density  of  the  Cauchy  noise  in  our  numerical  examples  is  given  by 


fk(xW)  = 


tt[1  +  (xW  -  7fe)2] 


,  k  =  1,2 


where  —  oo  <  7*  <  00  is  the  median  (assumed  known). 

Although  the  second-order  joint  density  of  a  Cauchy  noise  is  difficult  to  characterize  directly, 
it  can  be  calculated  from  the  second-order  joint  density  of  a  Gaussian  process  by  a  nonlinear 
transformation  (see  [6]).  Let 

/G(x(*))  =  exp[-(xW-7fc)2/2] 
y/2ir 


and 


/G(*(fc),j/(fc)) 


Ml  -  PG^)  2(1  —  PqI) 


exp{- 


—  1 


25 


be  the  univariate  and  the  second-order  joint  densities  of  the  underlying  Gaussian  process,  where 
Pq^  is  the  correlation  coefficient.  Then  the  nonlinear  transformation  mentioned  above  has  the 
form  T(x)  =  tan[7r  •  erf(x/\/2)/2]  (see  [9]).  In  addition,  we  assume  that  the  underlining  Gaus¬ 
sian  process  for  the  corresponding  Cauchy  noise  of  each  sensor  is  characterized  the  following 
autoregressive  model  with  the  correlation  coefficients  -1  <  Pk  <  1  for  k  =  1,2 


N(k]  _  y(k) 

*f]  =  PkNlH  +  y/l  -piv}k);  i  >  1 


where  both  V[k\  for  i  =  1  , 2, •••,«;  k  —  1,2,  are  sequences  of  i.i.d.  Gaussian  random  variables 
and  have  standard  Gaussian  densities.  In  the  following  examples,  for  the  cases  of  dependence 
across  time  only  (Examples  1  and  3),  V^\  for  i  =  1,2,  •  •  •,»  are  generated  independently 
of  each  other;  for  the  cases  of  dependence  across  time  and  sensors  (Examples  2  and  4),  they  are 
generated  dependently  as  follows 


v/2) = Pcvp + 

where  and  Wt  are  two  independent  i.i.d.  Gaussian  processes  and  —  1  <  pc  <  1  is  another 
correlation  parameter. 

In  the  following  examples,  we  give  the  plots  of  the  optimal  nonlinearities  and  figures  of  the 
receiver  operating  characteristic  (ROC)  type  for  the  above  weak  signal  in  additive  noise.  In  the 
figures  of  ROC,  the  two  types  of  average  cost,  Eq  and  Ei,  are  normalized  with  respect  to  p  and 
1  —  p,  respectively. 

Example  1:  This  example  is  for  the  case  of  dependence  across  time  only.  The  parameters  of 
the  above  noise  model  are  set  to  be  71  =  0.0,  p\  —  0.95  and  m  =  150  for  sensor  1;  72  =  0.50, 
p2  =  0.90  and  m  =  100  for  sensor  2.  The  costs  are  chosen  as  ci  =  c[  =  1.0  and  C2  =  c'2  = 
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0.50.  The  optimal  nonlinearities  gk,opt(x)  (k  =  1,2),  and  the  ones  obtained  by  ignoring  the 
dependence  gktiid(%)  —  ~fk(x)/fk(x)  (k  =  1,2),  are  given  in  Figures  1.1  and  1.2  for  the  two 
sensors,  respectively.  The  average  costs  resulted  by  employing  these  two  forms  of  nonlinearities 
are  shown  in  Figure  1.3  for  sensor  1  under  two  fixed  thresholds  of  sensor  2  which  give  the  best 
performances  of  sensor  2  (at  their  individual  crossing  points  in  Figure  1.4  for  two  distinct  ROCs), 
and  in  Figure  1.4  for  sensor  2  for  two  fixed  thresholds  of  sensor  1  which  give  the  best  performance 
of  sensor  1  (at  their  individual  crossing  points  in  Figures  1.3  for  two  distinct  ROCs).  In  Figures 
1.3  and  1.4  the  solid  lines  represent  the  ROCs  of  employing  the  gk,opt  (k  —  1,2)  and  the  dot- 
dash  lines  represent  the  ROCs  of  employing  the  gk,iid  (k  =  1,2).  From  Figures  1.3  and  1.4  we 
conclude  that  the  performance  obtained  by  employing  gk,ad  is  better  than  the  one  obtained  by 
ignoring  the  dependence  across  time  for  this  specific  example. 

Example  2:  This  example  is  for  the  scheme  in  the  case  of  Subsection  III.B.  Here  we  set 
7i  =  72  =  0.0,  pi  =  p2  =  0.95  and  m  =  200  for  both  sensors,  and  the  correlation  parameter 
across  time  is  set  to  be  pc  =  —0.90.  After  numerical  tests,  we  obtain  rw  =  0.0225  in  Eq.  (57). 
In  Figure  2.1,  we  draw  the  nonlinearity  g%  given  in  Eq.  (57),  which  is  used  by  both  sensors. 
In  that  figure,  the  optimal  nonlinearities  obtained  by  ignoring  the  dependence  across  sensors 
QUd  (represented  by  dash  lines),  as  well  as  across  time  and  sensors  gad  (represented  by  dot-dash 
lines)  are  also  included.  Notice  that  g*(x )  =  gidd{x)  in  Eq.  (57).  In  Figure  2.2,  we  give  the 
plots  of  the  average  costs  E0/p  and  E\/(l  -  p)  (ROC)  as  a  function  of  the  threshold  used  by 
both  sensors.  The  solid  lines  represent  the  ROC  of  employing  the  gopt  =  rwgidd  +  (1  -  rw)gl,  the 
dash  lines  represent  the  ROC  of  employing  the  gidd ,  and  the  dot-dash  lines  represent  the  ROC 
of  employing  the  gad.  From  Figure  2.2  we  conclude  the  performance  obtained  by  employing  gopt 
is  the  best  for  this  specific  example. 
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V.B.  Strong  Signals 

Similarly,  we  now  show  the  performance  for  the  strong  signal  model  described  in  Section 
IV.  The  two  observations  hypotheses  under  considerations  are  characterized  by  the  stationary 
log-normal  densities  (univariate  and  second-order  joint)  under  H\,  and  the  stationary  Rayleigh 
densities  (univariate  and  second-order  joint)  under  Hq.  The  log-normal  processes  (i  = 
1,2,  •  ••;&  =  1,2)  are  obtained  from  the  nonlinear  transformations  xj^  =  exp[ivj^  -f  fi^p]  in 
which  are  generated  as  follows 


jy(l)  _  y(*) 
JV1,1  -  Kl,l 


<i’  =  i  >  1 


where  pi^  are  correlation  coefficients  and  Vjj?  (i  =  1,2, -*-,n)  for  sensor  k  (k  =  1,2)  is  a 
sequence  of  i.i.d.  Gaussian  random  variables  and  has  Gaussian  density  Af(l,  (tf^o©)6)-  ln  the 
case  of  dependence  across  time  only  and  are  generated  independently;  in  the  case  of 
dependence  across  time  and  sensors,  they  are  generated  dependency  as 

Vff  =  +  y/i  -  el,wu 


where  V"/ ^  and  W\ti  are  two  independent  i.i.d.  Gaussian  processes  and  —  1  <  pVii  <  1  is  an  cor¬ 
relation  parameter.  The  Rayleigh  processes  under  Ho ,  Xq*\  are  generated  by  two  independent 
Gaussian  processes  and  Z^J  (i  =  1,2 ,•••;£  =  1,2)  as  X^J  =  \Z(Y0[^)2  +  (Z^J)2  where 
y07  and  Zq^  are  generated  as  follows 


i  >  1 


and 


^0,1  — 


Tj(k) 

u0,l 
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=  PojkZ$-i  +  ~  plXh  * >  1 


where  p0,fc  are  correlation  coefficients  and  for  sensor  k  (k  =  1,2)  UqJ  (i  =  1,2,  •••)  are 

two  i.i.d.  Gaussian  sequences  with  the  same  density  •hT(f,&c,i)'  In  the  case  of  dependence  across 
time  only  (vj1^,  Vq^)  are  generated  independently,  so  is  the  pair  (UqJ,  UqJ))',  in  the  case  of 
dependence  across  time  and  sensors,  they  are  generated  dependently  as 


vj?  =  PvfiVoJ  +  \J  1  -  pl,0Wo,i 

and 

U$>  =  PvXJ  +  y/l-PloWo,i 

where  W0,i  is  another  i.i.d.  Gaussian  sequence  with  the  density  Af(r,  (cr^)6)  and  -1  <  pv<0  <  1 
is  an  correlation  parameter;  uJJ,  and  W0,i  are  mutually  independent. 

Example  3:  This  example  is  for  the  case  of  dependence  across  time  only  characterized  in 
Subsection  IV.A.  The  ms  are  set  to  be  300  for  both  sensors  under  the  two  hypotheses.  For 
sensor  1,  the  parameter  values  are  set  to  be  <7V£  ^  =  2.0  and  p\yo  =  0.926  under  Ho  and 
frj,1)  =  0.242,  =  0.798  and  p\p  =  0.992  under  Hi;  for  sensor  2,  a  $  =  2.20  and  p2,o  =  0.90 

under  H0,  and  a{JJ  =  0.242,  =  0.893  and  p2, i  =  0.990.  After  numerical  tests  for  the 

optimization  search,  we  obtain  7i  =  72  =  0.9231  in  Eqs.  (75)-(77)  and  (79)-(82).  In  Figures 
3.1  and  3.2,  the  two  optimal  nonlinearities  gk<opt  ( k  =  1,2)  for  the  strong  signal  model  are  given 
for  the  two  sensors;  there  the  nonlinearities  gk,iid(x)  =  ln[/k,i(x)//fc,o(a;)]  (k  =  1,2)  are  also 
described  by  dot-dash  lines.  The  ROCs  similar  to  the  ones  in  Example  1  are  drawn  in  Figures 
3.3  and  3.4  for  the  two  sensors,  respectively.  In  Figures  3.3  and  3.4,  the  distinct  ROCs  are 
represented  by  lines  similar  to  the  ones  in  Example  1.  The  ROCs  of  employing  gk,h  (k  =  1,2) 
which  are  obtained  by  using  71  =  72  =  0.5  are  also  included  in  Figures  3.3  and  3.4  (represented 
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by  dotted  lines).  From  the  figures  of  ROCs  we  conclude  that  the  performance  obtained  by 
employing  the  gk,opt  of  this  strong  signal  model  is  the  best  for  this  specific  example. 

Example  4:  This  example  is  for  the  case  of  dependence  across  time  and  sensors  characterized 
in  Subsection  IV. B.  The  parameter  values  are  the  same  as  those  of  sensor  1  in  Example  3  except 
that  m  =  600.  We  also  set  pVi0  =  0.90  and  pVy\  =  0.99.  By  using  numerical  tests,  we  set 
7a  =  7;,  =  0.9231  in  Subsection  IV. B,  and  rs  =  0.95  in  Eq.  (95).  The  nonlinearity  of  this 
strong  signal  model  which  is  used  in  Eq.  (95)  and  is  shown  in  Figure  4.1;  there  the  nonlinerities 
obtained  by  ignoring  the  dependence  across  sensors  gidd  (represented  by  dash  lines),  and  across 
time  and  sensors  gad  (represented  by  dot-dash  lines)  are  also  included.  The  ROCs  similar  to 
the  ones  in  Example  2  are  drawn  in  Figure  4.2,  where  distinct  ROCs  are  represented  by  lines 
similar  to  those  in  Example  2.  Again  in  Figure  4.2,  we  include  the  ROCs  of  employing  which 
is  obtained  by  using  7a  =  76  =  0.5  (represented  by  dotted  lines).  From  the  ROCs  we  conclude 
that  the  performance  obtained  by  employing  gopt  =  Twgidd  +  (1  -  rw)gl  is  better  than  the  one 
obtained  by  ignoring  the  dependence  across  time,  or  across  time  and  sensors  for  this  specific 
example. 
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VI.  CONCLUSIONS 


In  this  paper,  we  extended  the  scheme  of  memoryless  single-sensor  detection  to  distributed 
detection  for  two  types  of  signal:  the  weak  and  the  strong  singles  which  have  stationary  univari¬ 
ate  and  second-order  densities.  The  model  of  dependence  used  in  this  paper  for  the  correlation 
of  noise  (across  time  and/or  sensors)  is  characterized  by  one  of  the  mixing  sequences:  the  m- 
dependent,  the  ^-mixing  and  the  p-mixing  sequences.  The  performance  is  measured  by  the 
average  cost  which  couples  the  sensor  decisions  together  and  consists  of  the  error  probabilities 
involved  in  distributed  detection.  By  using  the  two-dimensional  Chernoff  bounds  on  the  error 
probabilities  and  some  approximation  techniques,  we  derived  the  design  criteria  for  the  memo¬ 
ryless  nonlinearities  of  the  sensors,  which  are  the  generalizations  of  the  AREs  in  corresponding 
single-sensor  detection.  Optimization  of  the  design  criteria  for  the  nonlinearities  in  different 
cases  led  to  linear  integral  equations. 

The  numerical  results  from  simulations  indicate  that  in  performance,  the  proposed  schemes 
by  employing  optimal  memoryless  nonlinearities  which  consider  the  dependence  (across  time 
and/or  sensors)  overmatch  the  corresponding  ones  by  ignoring  that  dependence  for  different 
cases  of  observation  descriptions  (weak  signal  and  strong  signal)  and  dependence  (dependence 
across  time  only  and  dependence  across  time  and  sensors). 

Although  we  focused  on  two-sensor  detection,  the  schemes  and  the  techniques  which  are  used 
in  this  work  can  be  applied  to  multi-sensor  detection  in  which  frequently  the  continuous-time 
optimization  methods  are  necessary. 
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Fig.  1.1  The  Nonlinearities  of  Sensor  1  in  Example 
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Fig.  1.2  The  Nonlinearities  of  Sensor  2  in  Example 
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Fig.  2.1  The  Nonlinearities  of  the  Two  Sensors  in  Example 
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Fig.  3.1  The  Nonlinearities  of  Sensor  1  in  Example 
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Fig.  3.3  ROCs  for  Example  3  under  a  Fixed  Threshold  of  Sensor 
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